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ABSTRACT 

It is widely accepted that observations at mid-infrared (mid-IR) wavelengths enable the selection 
of galaxies with nuclear activity, which may not be revealed even in the deepest X-ray surveys. 
Many mid-IR color-color criteria have been explored to accomplish this goal and tested thoroughly 
in the literature. Besides missing many low-luminosity active galactic nuclei (AGN), one of the 
main conclusions is that, with increasing redshift, the contamination by non-active galaxies becomes 
significant (especially at z > 2.5). This is problematic for the study of the AGN phenomenon in the 
early Universe, the main goal of many of the current and future deep extra-galactic surveys. In this 
work new near- and mid-IR color diagnostics are explored, aiming for improved efficiency — better 
completeness and less contamination — in selecting AGN out to very high redshifts. We restrict our 
study to the James Webb Space Telescope wavelength range (0.6-27/im). The criteria are created 
based on the predictions by state-of-the-art galaxy and AGN templates covering a wide variety of 
galaxy properties, and tested against control samples with deep multi-wavelength coverage (ranging 
from the X-rays to radio frequencies). We show that the colors K s — [4.5], [4.5] — [8.0], and [8.0] — [24] 
are ideal as AGN/non-AGN diagnostics at, respectively, z < 1, 1 < z < 2.5, and z > 2.5 — 3. 
However, when the source redshift is unknown, these colors should be combined. We thus develop an 
improved IR criterion (using K and IRAC bands, KI) as a new alternative at z < 2.5. KI does not 
show improved completeness (50-60% overall) in comparison to commonly used IRAC-based AGN 
criteria, but is less affected by non-AGN contamination (revealing a > 50-90% level of successful AGN 
selection). We also propose KIM (using K, IRAC, and MIPS-24/xm bands, KIM), which aims to select 
AGN hosts from local distances to as far back as the end of reionization (0 < z < 7) with reduced 
non-AGN contamination. However, the necessary testing-constraints and the small control-sample 
sizes prevent the confirmation of its improved efficiency at z > 2.5. Overall, KIM shows a ~30-40% 
completeness and a > 70-90% level of successful AGN selection. KI and KIM are built to be reliable 
against a ^10-20% error in flux, are based on existing filters, and are suitable for immediate use. 
Subject headings: galaxies: active; galaxies: high-redshift; Infrared: galaxies 



1. INTRODUCTION 

The active galactic nucleus (AGN) phenomenon in 
galaxies is currently the focus of a substantial amount 
of attention, d ue to its role in altering the path of galaxy 
evolution (e.g . .lGranato et al.ll2004l iSpringel et alJl2005b 
Croton et all 120061: iHopkins et all 120061 : iBower et al.1 
20061: iSomerville et all 120081 ). The quest to find these 



sources and the techniques used in identifying them are 
constantly shaping our understanding of the underly- 
ing physics of AGN, which are currently believed to be 
the result of mass accretion onto super m assive black 
holes at the very c entre of galaxies (e.g., lLaorl Il999l 
iMelia fc Falckd2001| ). To identify AGN, a miriad of tech- 
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niques have been developed, spanning esse ntially the full 
electr omagneti c spectrum (e.g., X -rays, iSzokolv et al.1 
120041 : optical, Richard s et al.1 l2002t in f rared, see refer- 
ences below; radio. ITielens et al.l 119791: IChambers et al.l 
Il996h . 

A wide variety of both intrinsic and observational ef- 
fects, make the identification of large, statistically ro- 
bust, but also complete and reliable, populations of AGN 
galaxies challenging. Such effects encompass obscura- 
tion, accretion rate, variability, host galaxy light, viewing 
angle, and more. Different aspects of AGN radiation may 
be observed across the entire electromagnetic spectrum 
(from direct light at high energies to s ynchrotron radi- 
ation dominating at radio frequencies. iTreister fc Urrvl 
l20ll . It is common, though, to observe and analyze 
aspects of AGN activity limited to only one spectral 
regime. This has been verified in extreme radio sources 
( Afon so et alj|2006t iSimpson et al.ll2005 iMorganti et~aH 
201lHNorris et al.1120111) and in some X-ray sources (e.g., 
Loewenstein et al.l 120011 IHo et al.l 120031: iMaiolino etaLl 



20031 : iPolletta et al.l 120061 1. These specific sources, show 



an ultraviolet-to-infrared spectrum with no evidence for 
AGN activity, and some of these AGN may not even 
present any counterpart at those wavelengths (as a re- 
sult of, for instance, extreme obscuration). Even more 
generally, there is strong evidence for a large popula- 
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tion of obscured AGN, which stil l remain unidentified 
at high-energy spectral-bands (e.g-JComastri et"afl | 2001 
Ueda et all 12003: [Gmi1[200l IWorslev et all 12004 p005 
Treister fc Urrvl 120051: iMartmez-Sansigre et"aiT 12005 



Draper fc Ballantvnel 12009) . If large dust column densi 



ties are the driver for such extreme obscuration (instead 
of just gas), then infrared (IR) wavelengths, at which the 
obscuring dust emits, are an obvious choice in the search 
for such extreme objects. 

Base d on data from the Infrare d Astronomical Satellite 
(IR AS. iNeueebauer et all Il98l. Ide Griip et all ([19851 ) 
and iMilev et al.l (|1985| ) realized that simple spectral- 
index or color-color diagnostics could be efficient ways 
to select Seyfert- and QSO- type galaxies. Mor e recently, 
with the launch of Spitzer (| Werner et al.|[2003 ). the same 
techniques were applied at the shorter wavele ngths (3.6 — 
8 /xm ) of the Infrared Array Camera (IRAC . [Fazio et all 
20041). wh e re ho tt er dust temperatures are p robed. 
Lacv et all (l2004f) . lHatziminaoglou et al.l i|2l)l )■")). and 



Stern et al.l (|2005l ) proposed different IRAC colo r-color 
plots to select AGN (see also iSaiina et al.l 120051) . while 
lAlonso-Herrero et al.1 (|2006D and iDonlev et all (|2007l ) 
proposed spectral-index diagnostics to select sources re- 
vealing power-law (/ oc v a ) spec tral energy distributions 
(SEDs) characte r istic of AGN rtNeugebauer et al.l 119791: 
lElvis et al.lll994l: llvezic et al.l l2002f ) . Later, by compar- 
ison of the optical spectral regime with IR wave-bands, 
other techniques were pr oposed. Either through SED fit- 
ting (j.Daddi et al. 2007) or extreme R — [24] color cuts 
at different 24 /an fluxes dPolletta et alJl200a iDev et al.1 
120081: iFiore et al.l 120081: IDonlev et al.ll2010D . the selected 
galaxies are too red and bright for a normal star-forming 
SED, and hence indicative of AGN activity. 

However, some critical shortcomings have been 
highlighted specially regarding t he IRAC colo r -color 
AGN selecti o n tec h nique (e.g., IBarmby et al.l 120061: 
Donlev et all 120081: lEckart et all 120101: IDonlev et al.l 
20121) . Having been developed based on the shal- 
low first generation IRAC surveys, these techniques 
fail in deeper IRAC surveys, where higher redshift 
non-AGN sources pollute the adopted selection criteria 
(IBarmby et al.1 120061 : iPapovich et all 120061: IDonlev et al.1 
120081: iMessias et al.l 120101: IDonlev et all I2012TT Also, 
this technique is strongly biased aga i nst selecting low- 
luminosity AGN dTreister et al.l 120061: ICardamone et al 
20081: IDonlev et al.ll2008t lEckart et aLll2010HPetric et al 
20111 : IDonlev et al.1 12012T ) , an evolutionary phase where 
AGN are believed t o spend most of their life span (e.g., 
Hopkins et aT]|2005l l200l iFu et al.ll2010h . Furthermore, 
while the IR selection techniques return a variety of AGN 
populations, there seems to be a bias to ward the selection 
of unobscured over obscured sources dStern et al.l l2005fc 
Donlev et ai1l2007t ICardamone et al.l 2008; Eckart_e±al] 



20101 ) . which, strikingly, is the opposite of the goal that 



drives IR AGN selection in the first place. 

More recently, alternative diagnostics hav e been pro- 
posed either tuned to spec i fic samples (e . g.. iGarn et al.l 
120091: IMessias et all l2010t IDonlev et all I2012I ). or to 
newly available filter sets (lAssef et al.ll2010f). or in com- 
bination with other te chniques d Richards et al.l 120091 : 
lEdelson fc Mal kan 2012). Such work is crucial and the 
quest for revised and/or improved color-color criteria 
should continue, as it is important to identify key filter 
sets in order to maximize survey efficiency, as well as cur- 



rent (e.g., ISalvato et al.lEHM [20TTt IHainline et al.ll2011t 
iPozzi et al.ll2012D and future SED fitting procedures aim- 
ing to break down the strong degeneracy between AGN 
and non-AGN SEDs. 

In this work, we explore new IR color selections with 
the goal to achieve improved efficiency (better complete- 
ness and less non-AGN contamination). In Section [2] 
different possibilities for the mechanisms behind the IR 
emission are discussed, and the new criteria are pre- 
sented in Section [3l A series of tests of these new 
criteria are explored in Section [4] using a broad set 
of deep multi-wavelength control samples (selected at 
wavelengths ranging from X-ray to radio). Implications 
for and expectations from James Webb Space Telescope 
(JWST) surveys are highlighted in Section [5l followed 
by the conclusions of this work in Section |6l 

Throughout this paper we use the AB magnitude sys- 
tem 7 , assuming a ACDM model with Ho = 70 km s _1 
Mpc-\ Q M = 0.3, fl A = 0.7. 

2. DISTINGUISHING AGN FROM NON-AGN IR 
CONTRIBUTIONS 

The SEDs of stellar and/or star-formation (SF) dom- 
inated systems (henceforth referred to as normal galaxy 
SEDs) have some distinctive characteristics, allowing the 
separation of this population from AGN host galaxies 
through the use of IR colors alone. Figure Q] illustrates 
a few examples using gala xy templates taken from the 
SWIRE Template Library (|Polletta et al.ll2007h . In nor- 
mal galaxy SEDs, a dip in the 1-6 /zm rest-frame range 
is expected. The overall blackbody emission from the 
stellar population combined with the minimum in the 
opacity of the H~ ion in stellar photospheres, peaks at 
~ 1.6 Atm, which is generally observed, as is the CO ab- 
sorption at 2.35-2.5 /^m from red supergiants. Emission 
from hot dust surrounding stars in the asymptotic gi- 
ant branch (AGB) phase is expecte d in young systems 
(5, 1 Gyr) and to peak at 2 — 5 urn (jSaiina et al.l 120051 : 



lMarastoiJ2005tlHenriques et al.ll2011| ). Furthermore, the 
strength of the polycyclic aromatic hydrocarbons (PAH) 
features, seen mostly beyond 6 /Ltm, increases with star 
formation activity and metallicity (Calzetti et all 120071 : 
lEngelbracht et all 120081: Hunt et al.ll2010D . It is in this 
spectral region (1-6 /im) that the difference between nor- 
mal galaxies and AGN dominated SEDs is the greatest. 
The existence of an AGN is frequently accompanied by a 
power-law continuum (f„ oc v a , e.g.. | Neugebauer et al.l 



[19791: lEl^iTet^ll993 ' , rivezic et al.M200l " rising in flux 



beyond ~ 1 /im. This is a consequence of X-ray-to- 
optical AGN radiation being reprocessed to IR wave- 
lengths by du st surrounding the central region of an 
activ e galaxy (Sanders et al.l 119891 119991 : iPier fc Krolik I 
1992). This feature, being different from the dip ob- 
served in normal galaxy SEDs, is unique for AGN hosts 
and enables their selection through SED inspection. 

2.1. The template set 

The templates used throughout this paper come from 
published work as follows: 10 templates covering early 

7 When necessary the following relations are used: (K, 
[3.6], [4.5], [5.8], [8.0])as = (K, [3.6 ], [4.5], [5.8 ], [8.0]) VeS a 
+ (1.841, 2.79, 3.26, 3.73, 4.40) URoche et a lj [2U03I , and 
http: / /spider. ipac.caltech.edu/staff/gillian/cal. html ). 
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Figure 1. Exampl es of galaxy templat es taken from the SWIRE 
Template Library (Pollctta ct al. 2007) and flux normalized at 
1.6/^m: SO (early type galaxy, red dotted line), M82 (starburst 
galaxy, blue solid line), IRAS 19254-7245 (a hybrid source, ma- 
genta dashed line), and a type-1 QSO (AGN, black dot-dashed 
line). The shaded regions show what rest-frame wavelength the K, 
IRAC, and MIPS 24/im filters will be observing depending on the 
redshift. 

to late galaxy typ es, five starbur s ts, fou r hybrids 8 , and 
six AGN, all from IPolletta et al.l ()2007D : ni ne starburst 
Ultra- Luminous IR Galaxies (ULIRGs) from lRieke et al.l 
one blue s tarburst and 18 hybrid SEDs from 
iSalvato et al J (120091): and one extremely obscured hybrid 
from lAfonso et all (|2001f ). 

All templates have been fully characterized in the pa- 
pers which introduce them (see above). We assume that 
they sam ple adequately the color- z space, based on the 
results bv lSalvato eFatl (|2009l 1201 ID . In these works, the 
templ ates above (exce pt those from lAfonso et al.l I2001L 
and iRieke et al.l [2009) were used to compute reliable 
photometric redshift estimates (reaching an accuracy of 
oaz/(i+ Zbpoc ) ~ 0.015, where Az = \z spcc - z phot |) for 
both bright, nearby objects and faint, high- z sources (z < 
5.5). B y fu rther adopting the models from lAfonso et al] 
(|2001l ) and IRieke etail (|2009t) . the whole template set 
aims to account for a wide range of mixtures between 
star-formation and AGN emission in galaxy SEDs and 
their evolution across cosmic time, as evidenced by ob- 
servations (e.g..|Farrah et al.ll2007t iPapovich et al1 | 2007| : 
Rigbv et all 1 20081: IPope et all 120081: iFiore et aTTbooa 
Donlev et all 120101: iFu et all [20101: lElbaz et aLp201lD . 
This is a valid assumption given that local templates are 
successful in fitting some of the most extreme high red- 
shift sources (for instance, the case of Arp220 as a local 
analog of HR10, an extre mely red galaxy a t z = 1.44, 
IHu h Ridgwavl Il99l and lElbaz et al.l [20021 or M82 as 
an analo g for star-format ion dominated sub-millimctcr 
galaxies, IPope et aLll2008|) . 

Five ULIRGs (IRAS sources 12112+0305, 14348-1447, 
17208-0018, 20551-4250, and 22491-1808) which were 
previously considered as pure-starbursts in the litera- 
ture, are labelled as hybrid sources in this work. This 

8 By hybrids we refer to SEDs simultaneously including stel- 
lar/SF and AGN emission. 



is based o n the findings by Veilleux et al.l (|2009D and the 
review bv lSani fc Nardinj ( 20111 ). where IR spectroscopy 
is shown to reveal significant AGN contribution (6-30% 
at 8-1000 /zm, 8-40% at 5-8 /mi) in these ULIRGs. 

Table Q] lists the adopted template set. The SED 
templates are organized in four groups: (a) Early to 
Late-type galaxies, (b) Starbursts, (c) Hybrids, and (d) 
AGN. The following investigation will focus on how these 
groups populate near-to-mid IR color-color spaces, aim- 
ing to separate the AGN/Hybrid population, (c) and (d) 
above, from normal galaxies, i.e., (a) and (b). 



3. THE NEW APPROACH 
3.1. An enhanced wedge diagram: the KI criterion 

In Figure [2] the color tracks (spanning the range < 
z < 7) for the template SEDs considered are presented on 
the L07 (left plot) and S05 (right plot) crite ria color-color 
spaces . The criterion recently proposed bv lDonlev et al.l 
(|2012t D12 henceforward) is also shown in the L07 color- 
color space, with the caveat that the D12 criterion re- 
quires a monotonically increasing red SED in addition to 
the color cuts shown in the figure. These criteria consider 
the 3.6-8.0 /mi observed-frame (the IRAC-frame) to se- 
lect AGN candidates. This is the spectral range where 
AGN-dominated SEDs are better distinguishable from 
normal galaxy SEDs at rest-frame wavelengths, meaning 
that redshift will play a role in the selection efficiency. 
In both color-color spaces, the nominal AGN regions en- 
compass most of the AGN and hybrid tracks for a large 
range of redshifts, as they were built to do (the D12 cri- 
terion focus only on the most SED-dominant AGN). 

We note, however, that the use of a very diverse 
SED template set already shows some shortcomings of 
these diagnostic plots. In both plots, the two upper 
panels (early/late and starburst galaxies) show a sig- 
nificant contamination of the nominal AGN region of 
L07 and S05 (and D12 to a smaller degree) by nor- 
mal (non-AGN) galaxies. Each criterion is differently 
affected at different redshifts. Even when ignoring pho- 
tometric scatter (shaded regions in the figures), one ob- 
serves non-AGN contamination in at least one crite- 
rion throughout the whole < z < 7 range. This 
has been noted by p revious studies testing the L07, 
S05. a nd D12 criteria (iBarmbv et aTll2006l: IDonlev et al.l 
l2008t iCardamone et al.l 120081: IDonlev et al.l I2012D . but 
it is also found fo r other IR techn i ques at higher red- 
shifts (z > 2: e.g. IPope et al.l 120081: IDonlev et all [200l 
[20Tot iFadda et al.l 120101 : iNaravanan et al.1 12010ft . While 
at low- redshifts (z < 2 — 3), the contamination is due 
to strong PAH emission as well as extreme obscuration 
differently affecting each IRAC channel, at higher red- 
hifts (z > 2 — 3), the contamination is mostly driven 
by the limitation in observed wavelengths (the IRAC 
frame, 3.6 < A [/.mi] < 8). At these redshifts, IRAC 
bands mostly probe the rest-frame A < 1.6 /mi, where 
obscured star-forming systems (frequently identified at 
those redshifts) present red colors mimicking those of 
AGN. The clear implication is that any AGN criteria 
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Figure 2. Model color tracks displayed in the L07 (left plot) and S05 (right plot) criterion color-color space. Long-dashed blue lines 
correspond to the boundaries pro posed in those works for the selection of AGN, while short-dashed blue lines in the left plot correspond to 
the color boundaries proposed by Donley ct al. (2012). Each panel presents a specific group: (a) Early/Late, (b) starburst, (c) Hybrid and 
(d) AGN. The dotted portion of the tracks refers to the < z < 1 rcdshift range, and the solid to 1 < z < 7. Red circles along the lines 
mark z = 2.5. The light and dark-grey regions show the photometric scatter due to a magnit ude error of, respec tively, 0.1 and 0.2 in the 
considered bands (equivalent to ~ 10% and ~ 20% error in flux, respectively). Dwarf stars (Patten ct al. 200j|) are shown for reference: 
M-dwarfs appear as open cyan squares, L-dwarfs as open green circles, and T-dwarfs as open magenta triangles to show where these red 
point-like cool stars appear. 



restricted to the IRAC-frame will never be able to dis- 
entangle AGN from non-AGN populations at z > 3. 
In Figure [2j the highest redshift where non-AGN tem- 
plates are restricted to lying outside the S05 and D12 
selecting regions is z ~ 2.5. At higher redshifts, L07, 
S05, and D12 are all contaminated to the same degree. 
At z < 2.5, the D12 criteria is the least contaminated 
(as shown in lDonlev et a l. 2012), but being restricted to 
such a narrow color-color space, it will miss a signifi- 
cant number of low- and intermediate-luminosity AGN 
(log(Lx[ergs -1 ]) < 44). In addition, cool dwarf stars 
may fall inside or close to the L07 and S05 regions, thus 
being potential (point-like) contaminants. 

In order to enhance these wedge diagrams, one can 
extend the wavelength coverage to different wavebands. 
This is obviously outside the IRAC framework behind 
the original definition of such wedge diagrams, but suits 
the larger JWST wavelength coverage. For instance, by 
considering shorter wavelengths ( < 3 fim) , one is probing 
a spectral region mostly dominated by stellar emission 9 
(see Figure [T]). Such a scenario is an advantage as we 
now compare a stellar dominated wave-band with one 
at longer wavelengths (e.g., the IRAC-frame) that has 
contribution either from stellar or AGN light. Such a 
comparison will yield a large color dispersion, which is 
ideal for the separation of the two types of systems. 

A particularly relevant combination of colors is K — 
[4.5] versus [4.5] - [8.0] (Figure 13}. This if+IRAC com- 
bination, henceforth called the KI criterion, is defined by 
the following simple conditions: 

K - [4.5] > 

9 However, AGN emission m ay also dom i nate at such short IR 
wavelengths. See. for example, Assef et al. (2011), Hainlinc ct al. 
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Figure 3. The proposed KI criterion. Symbols and panel defini- 
tion as in Figure [2] 



and 



[4.5] - [8.0] > 



These color cuts result from a balance between reject- 
ing the non-AGN 10-20% photometric scatter regions 
at z < 2.5 and encompassing the pure- AGN color tracks 
and respective photometric scatter regions. Using a wide 
variety of templates and taking into account photometric 
errors we aim to limit possible selection effects resulting 
from wedges created a posteriori, for example, by using 
a sample of AGN spectroscopically confirmed such as in 
L07 and S05. 

Similar to the L07, S05, and D12 criteria, the normal 
galaxy color tracks contaminate the KI selection region 
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Table 1 






List of templates. 




Class 


template 


Ref a 


Early/Late 


E1113 
E112 
EU5 

SO 

Sa 

Sb 

Sc 

Sd 
Sdm 
Spi4 


[2] 

11 

11 
11 
11 


Starburst 


M82 
Arp ZZ(J 
NGC O090 

t~i c 1 /"\ noon „non 

rLbO Uoz(J-g(Jo(J 

unp 1 1 A 

lb 14 
in ^_tv_ zooy 

In OZOD 

Li w u^±y . uo / 


[2J 
r -1 1 

Hi 

55 
„ 




PR1 T OTTTA 


W 


u u -a 

riy brio 


TR A °> 1 91 1 

irirt-D iziiz-t~UoUO 


\a til 








tt~> A o i Tono rim o 

1HAS 17208-0018 






T T~> AO no i n 1 i o n o 

1HAS 22491-1808 


[2,5] 




IRAS 20551-4250 


[2,6] 




IRAS 19254-7245 




NGC 6240 






Seyfert 2.0 






Seyfert 1.8 






ERO 


[1] 




plJ22491_10_TQSOl_90 


[3] 




plJ22491_20_TQSOl_80 






plJ22491_30_TQSOl_70 






I22491_40_TQSOl_60 






I22491_50_TQSO 1_50 






I22491_60_TQSOl_40 






I22491_70_TQSOl_30 






I22491_80_TQSOl_20 






I22491_90_TQSOl_10 






b0_lU_QSO2_90 






S0_20_QSO2_80 






b0_30_QbO2_70 






on /in /~\Oz~\n c n 

b0_4U_QSO2_b0 






on p"n Acni fn 

b0_50_QbO2_50 






on /"n /~\o/~\n /in 

b0_6U_QSO2_40 






S0_70_QSO2_30 






S0.80.QSO2.20 






S0.90.QSO2.10 




AGN 


Mrk 231 


[2j 




BQSOl 




QSOl 






TQSOl 






QS02 






Torus 





a Reference work from which the template was 
retrieved , and , if present , reference work re- 
vealing the presence of hidden or non-dominant 
AGN (right column) : [1] Afonso ct al. (2001), [2] 
Pollctta ct al. (2007), [3] Salvato ct al. (2009), [4] 
Rieke et al.| i2009f), [5 ] IVeilleux eTall [6] 
fc Nardinil 1120111 1. 

eux et all ]2009l) do not present results 
for IR AS 17208-0018 specifically, but lRieke et all 
J2009I . the work where that template comes from) 
use Spitzer- IRS spectroscopy on IRAS 22491-1808 
(studied by IVeilleux et al .1120091) to complete ISO 
observations on IRAS 17208-0018, invoking sim- 
ilarities between the two in the 5— ll^im range, 
where the analysis by [Vcillcux ct al. ( 2009) is 
done. 
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Figure 4. The effects of considering prominent lines in type-1 
QSO SED models. The small in set shows three such SEDs orig- 
inally from Pollctta ct al. (2007): two updated versions of QSOl 
(dashed blue line, now with an Ha line) and BQSOl (red dot- 
ted, now with both Ha and OIII lines), and the TQSOl template 
(solid black, which already includes Ha and OIII lines). On the 
main panel the original SED model color tracks are shown as solid 
lines (from top to bottom: TQSOl, QSOl, and BQSOl), whereas 
the inclusion of strong emission lines produces the deviations given 
by the dotted segments. Circles and triangles show 2 = 1 and 
2 = 3, respectively. The tracks extend from z = to z = 7. 

at z > 2.5. However, the KI contamination by normal 
galaxies at z < 2.5 appears significantly reduced in com- 
parison to L07 and S05, with no negative effect on the 
ability to select AGN (i.e., completeness). By avoiding 
the 10-20% error scatter region (light- and dark-grey re- 
gions) from starburst templates, the KI boundaries have 
improved efficiency on faint source classification. An- 
other conceptual improvement of KI is the unbounded 
upper right AGN region. This avoids the loss of heav- 
ily obscured AGN (with extremely red colors) . This is in 
contrast to the wedges defined in S05, for example, where 
the Torus template moves out of the selection region at 
the highest redshifts (z > 4). 

One can also note the usefulness of the simple K — 
[4.5] color in excluding low redshift normal galaxies: the 
condition K — [4.5] > is able to reject a large fraction 
of the z < 1 non-AGN galaxies. Such a property also 
makes this simple color-cut of great use for the study of 
AGN and star-formation co-evolution in the latter half 
of cosmic history. 

Not all the templates in Table [1] take into account 
prominent emission lines. These may affect the photom- 
etry and produce some degree of scatter in the color- 
color tracks. This is visible in Figure |4] where updated 
versions (highlighte d as dotted lines) o f the QSOl and 
BQSOl templates (|Polletta et all 120071) are considered, 
now with both Ha a nd OIII lines includ ed (visible in 
the TQSOl template. iPolletta et al.l,2007[ ). Although in 
specific redshift intervals (when a certain emission line is 
rcdshifted into a given filter), AGN sources with smaller 
AGN contribution in the IR (like BQSOl) fall out of the 
AGN region of the KI criterion, the bulk of the AGN 
population is expected to remain inside the KI bound- 
aries. 
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3.2. Extending to high redshifts: the KIM criterion 

One serious problem for all criteria investigated so far 
is the contamination by normal galaxies at z > 2.5. All 
four (L07, S05, D12, and KI) fail to disentangle AGN 
dominated systems from normal galaxies at those red- 
shifts. To avoid this problem, longer wavelength ob- 
served bands should be considered, allowing the obser- 
vation of the rest-frame A > 1.6 ^m regime, where AGN 
emission may dominate, at high redshifts. For this pur- 
pose, we exte nd our study to in clude the MIPS-24/im 
band (see also lSaiina et al.ll2005D . 

The use of this waveband for AGN selection has ad- 
ditional shortcomings. Normal galaxies show a wide 
IRAC-MIPS color range (as a result of different PAH 
and dust emissions between galaxies), which is further 
increased by redshift (the PAH features are redshifted 
out of the 8.0 /xm band at low-z, but redshifted into 
the 24 /im band at z ~ 2). This results in a con- 
siderable color overlap with AGNs, limiting the useful- 
ness of a single IRAC-MIPS color to separate both pop- 
ulations (lLacv et al.1 l2004t lHatziminaoglou et al.l 120051 : 
ICardamone et al. 2008). Nevertheless, some authors 
have used the MIPS-24 fim band for unique, extreme 
objects (e.g., the IR-excess technique, Section 14.21) or 
in single, unconve ntional situations. For instance, while 
iGarn et all (|200l use [8.0] - [24] against [5.8] - [8.0] for 
a z ~ 0. 8 sample to identify t hose sources showin g AGN 
activity, iTreister et all (|2006| ) and iMessias et all (|2010l ) 
use a single [8.0] — [24] color cut at, respectively, z ~ 2 
and z > 2.5 for the same purpose 10 . 

Colors involving the 24 /im band are also usually 
avoided due to the large wavelength gap between this 
band and other commonly available MIR bands (usu- 
ally the Spitzer-IRAC bands). At high-z (e.g., z ~ 3), 
however, the sampled rest-frame wavebands (2 and 6 /im 
corresponding to observed 8 and 24 /im, respectively) are 
not much more separated than the 3.6 and 8.0 /im IRAC 
bands for nearby galaxies. A different issue is the lower 
sensitivity and larger point spread function of the MIPS 
24 /im images when compared with those obtained with 
the IRAC channels, which affects the accuracy of color 
measurements using this longer wavelength band. Never- 
theless, efficient photometry algorith ms have been built 
to overcome these b lending problems (|Wuvts et al.ll2008l : 
iSantini et aLll2009D . 

Figure [5] illustrates a color-color AGN diagnostic pro- 
posed for z > 1, with its boundaries avoiding a 10% 
photometric error scatter from normal galaxies. The cri- 
terion is defined by the following IRAC-MIPS (IM) con- 
ditions: 



and 



.0] - [24] > -2.9 x ([4.5] - [8.0]) + 2. 
[8.0] - [24] > 0.5 



One can see that beyond z ~ 1, AGN (lower panels) 
and normal galaxies (upper panels) occupy essentially 
different regions in the [8.0]- [24] versus [4.5]- [8.0] space. 
This is of great interest for the characterization of high 




10 IIvison etTaTI puM l and lPope etTaTI lf200l 1 also explore [8.0] - 
[24] against [4.5] — [8.0] to distinguish AGN from normal galaxies, 
but, in those works, only the [4.5] — [8.0] color is effectively used 
for that purpose. 



2-1 
[4.5]-[8.0] (AB) 

Figure 5. The IRAC-MIPS color-color space and the proposed 
criterion. Symbols and panel definition as in Figure [2] The dot- 
dashed horizontal blue line (at [8.0] — [24] = 1) refers to a simpler 
criterion valid at z > 3, as detailed in the text and illustrated in 
Figure [6] 



redshift galaxy populations, such as L yman Break Galax- 
ies (LBGs) and equivalents a t z > 2 (Steidcl et al.1120031 
12004 Adclbcr ger et al.l 120041 ) . or spectroscopically con- 
firmed high-z galaxies. We note, nevertheless, the incom- 
pleteness toward type-1 QSOs at z > 4 — 5 11 or type-2 
QSOs with strong H a emission affecting the 4.5 /im band 
at z > 4.5 (Figure ID). Note, however, that Mrk231 and 
torus-type AGN remain inside the region for the full red- 
shift range studied (0 < z < 7). 

At low redshifts (z < 1), degeneracy exists in this IM 
color-color space, with AGN and normal galaxies occu- 
pying the same color-color region. A rejection of low- 
rcdshift (z < 1) normal/star forming galaxies would, 
however, remove this overlap, allowing for a power- 
ful AGN-selection criterion to be built. This can be 
achieved, as noted in the previous section, by using the 
K — [4.5] > color cut (Figure[3]). This allows the rejec- 
tion of a large fraction of the z < 1 non-AGN galaxies, 
with AGN and hybrid galaxies in this redshift range re- 
maining mostly unaffected. 

The IRAC-MIPS (IM) conditions presented above, 
when considered together with the K — [4.5] > 
cut, which implements the rejection of z < 1 normal 
galaxies, define what we will henceforth call the KIM 
(K+IRAC+MIPS) criterion. 

We further note from Figure [5] that for z > 3, essen- 
tially all SEDs with [8.0] - [24] > 1 are dominated by 
AGN emission. This happens because in this redshift 
range, the PAH features dominating a normal galaxy 
SED are redshifted out of the MIPS 2 4 Aim filter. Figure M 
details this behavior, clearly showing that stellar dom- 
inated galaxies at z > 3 show [8.0] — [24] < 1 colors, 
as described by IMessias et a l. (2010). Such a color cut, 
if possible to apply (one has to know the source red- 
shift), allows the recovery of Ha-emitting and/or rest- 
frame UV/optical blue QSOs expected to leave the IM 

11 However, type-1 QSOs with the largest optical-to-IR flux ra- 
tios migrate back in the AGN region at z > 6. 
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Figure 6. The [8.0]-[24] color evolution with redshift. Panel def- 
inition as in Figure [2] The horizontal line shows [8.0] — [24] = 1, 
while the vertical one indicates 2 = 3. At z > 3, only AGN domi- 
nated galaxies show [8.0] — [24] > 1 colors. 

region at z > 4 — 5 (Figure [5} . 

We acknowledge that by aiming at such a broad red- 
shift range (0 < z < 7) KIM has its shortcomings. For 
instance, if the goal is to characterize sources at z < 1, 
the use of K — [4.5] allows a reliable selection of AGN- 
hosts avoiding the need for longer wavelength bands. On 
the other hand, at z > 3, the [8.0] — [24] color is enough 
to identify AGN dominated sources, avoiding the loss 
of sources undetected at shorter wavelengths. Finally, 
the [4.5] — [8.0] colour is best at intermediate redshifts 
(1 <z < 2.5). 

However, a redshift estimate is frequently not avail- 
able (incomplete spectroscopic sample coverage) nor re- 
liable (photometric redshifts). Particularly for AGN- 
dominated sources, where a power-law SED is observed, 
the lack of spectral features (such as spectrum breaks) 
makes the photometric redshift solutions degenerate. In 
order to avoid such limitations, while still allowing AGN 
selection up to z ~ 7, the KIM criterion may be valuable. 

4. TESTING AGAINST MULTI- WAVELENGTH AGN 
SAMPLES 

In the previous section we have proposed: 

- A' — [4.5] as an useful color for the efficient segrega- 
tion of the galaxy population into AGN-dominated 
and normal SEDs at z < 1; 

- KI criterion as an alternative to L07, S05, and D12; 
and 

- KIM (a 4 band, 3 color criterion), as a diagnostic 
which, according to the color tracks of the tem- 
plates used, enables the selection of AGN sources 
at < z < 7 with little contamination by normal 
galaxies. 

This is of great interest as it allows one to trace AGN 
activity since the epoch of reionization to the current 
time. The usefulness of these criteria can only be eval- 
uated, however, by pursuing a test with well character- 
ized control samples. Using various samples for which 



normal-galaxy and AGN classifications are obtained via 
other spectral regimes (from X-rays to radio), we can ob- 
tain some estimate of the efficiency of the new proposed 
diagnostics in comparison with commonly used ones. 

Because an ideal sample, where all the sources are 
reliably split between AGN and non-AGN hosts, does 
not exist, we will perform these tests using five con- 
trol samples. First, we use a sample o f galaxies from 
the C handra Deep Field South (CDFS, iGiacconi et al.l 
120011 ) and a nother from the Co smic Evolution Survey 
(COSMOS, IScoville et all [2003), both with available 
AGN/non-AGN classification from X-rays and/or opti- 
cal spectroscopy. Sec ond, we assemble samples of IR- 
excess (IRxs) sourc es (jDev et al.H200l : iFiore et al.ll200l 
IPolletta et al1l200i found in CDFS and COSMOS fields. 
The Q SO sample from the Sloan Digital Sky Survey 
(SPSS. [Schneider et alJl2010T ). reaching z ~ 6, is also 
considered for the testing, as well as the Hi gh- z Radio 
Galaxy (HzRG) sample from lSevmour et all §007). The 
first two samples (X-ray/optical selected in CDFS and 
COSMOS) allow for an indication of the completeness 
and non-AGN contamination of the IR AGN selection 
criteria, while the AGN samples (IRxs sources, SDSS 
QSOs and HzRGs) will allow for independent measures 
of their completeness up to the highest redshifts. 

In the following subsections, Completeness (C) is de- 
fined as the fraction of the AGN population which a given 
IR criterion selects (AGNsbl/AGNtot), while Reliabil- 
ity (1Z) refers to the fraction of the IR sources selected by 
a given criterion which are "true" AGN (AGNsel/Nsel, 
where N S el = AGN S el + non-AGN SEL )- 

4.1. The CDFS and COSMOS samples 

We ha ve selected 2287 gal a xies from MUSIC /C DFS 
catalog (lGrazian et al.l 120061: iSantini et al.l |2009() and 
7217 from COSMOS (|Ihb^riet_alJ |2009|) with an X-ray 
classification (based on source luminosity and spectral 
slope) and/or a good quality 12 optical spectroscopic clas- 
sification. Whenever a spectroscopic redshift was not 
avai lable, the photometr ic estimates bv lLuo et al.l ()2010l ) 
and lSalvato et alJ (|2011( ) were adopted. 

The IR data u sed for the MUSIC catalog comes from 
iVandamiel (|2002f ) and Dickinson et al. (in prep . ), an d 
that for COSMOS co mes fr om iSanders et all (2007), 
iLe Floc'h etaU pOOl ). and iMcCracken eTafl $20W) . 
Regarding the X-rays, the 2 M s Chandra Deep Field 
South (CDF-S. Iluo et al.l[200l data was used, as well 
as the XMM data in COSMOS (jCannelluti et all 12003 
iBrusa et al1l2010D. The X-ray AGN classification is sim- 
ilar to that of iSzokolv et all (|2004l ). There, the X-ray 
luminosity and hardness-ratio (HR) are used to iden- 
tify the AGN population. The HR is a measure of the 
source obscuration and is defined as HR=(H-S)/(H+S) 
with H and S being, respectively, the net counts in 
the hard, 2-8 keV, and soft, 0.5-2 keV, X-ray bands. 
However, this ratio beco mes degenerated wi th redshift 
(lEckart et alJ 120061 an d iMessias et all 120101 but also 
lAlexander et al.l 120051 and ILuo et al. and count 

measurements depend on array efficiency between tele- 
scopes. Hence we compute for each source the respec- 

12 Spectra flagged as (very good) or 1 (good) in the MUSIC 
catalog, and estimates with > 90% probability of being correct in 
the ^COSMOS catalog. 



tive column densities (Nh) using the Portable, Interac- 
tive Multi-Mission Simulator 13 (PIMMS, version 3.9k). 
The soft-band/full-band (SB/FB) and hard-band/full- 
band (HB/FB) flux ratios 14 were estimated for a range of 
column densities (19 < log(NH[cm~ 2 ]) < 25, with steps 
of log(N H [cm- 2 ]) = 0.01), and redshifts (0 < z < 7, with 
steps of z = 0.01), consid ering a fixed photon index, 
r = 1.8 (|Tozzi et al.ll2006D . The comparison with the 
observed values results in the estimate of Njj. However, 
it is known that this procedure systematically overesti- 
mates the intrinsic Nh- The Appendix discusses this bias 
and how we attempted to correct it. The final Nh value 
is then used to derive an intrinsic X-ray lu minosity. The 
HR constraint used bv lSzokolv et all (|2004D (HR = -0.2) 
is equivalent to log(NH[cm~ 2 ]) = 22 at z ~ 0, and this is 
the value considered throughout the whole redshift range. 

Hence, an X-ray AGN is considered to have L^?' > 
lO^ergs" 1 and N H > 10 22 cm" 2 , or Llf > 10 42 ergs _1 . 
The remaining X-ray detections are regarded as non- 
AGN sources. The intrinsic X-ray (0.5-10 keV) lumi- 
nosities are estimated as: 

L^ = 47rd2f 1 f t (l + z) r - 2 ergs- 1 

where f^* is the obscuration-corrected X-ray flux in the 
0.5-10 keV band, and T is the observed photon in- 
dex (when log(N H [cm" 2 ]) < 19) or V = 1.8 (when 
log(N H [cm- 2 ]) > 19). In CDFS, th e 0.5- 8 keV lumi- 
nosities are derived using ILuo et all (|2008P ) catalogued 
0.5-8 keV fluxes and converted to 0.5-10 keV consider- 
ing the adopted T. For simplicity, the luminosity 'int' 
label is dropped from now on, as we will always be refer- 
ring to intrinsic luminosities, unless otherwise stated. 

Regarding the spectroscopic sample, the AGN sources 
are those which display broad line features or high- 
ionization narrow emission lines characteristic of AGN 
activity (BLAGN or NLAGN, respectively). The re- 
maining sources with a spectroscopic classification are 
regarded as part of the non-AGN population (e.g., SF 
galaxies, stars). The NLAGN classific ation comes from 
the M USIC catalog in CDFS, and from iBongiorno et al.l 
pMoft in COSMOS. 

In both the CDFS and COSMOS final AGN samples, 
most sources have an X-ray AGN classification (82% and 
80%, respectively), and a significant fraction has a spec- 
troscopic AGN classification (21% in CDFS and 55% in 
COSMOS, the difference resulting from different spectro- 
scopic completenesses between the two surveys). 

4.1.1. KI/KIM efficiency in CDFS 

For consistency, we only consider sources having pho- 
tometry estimates with a flux error smaller than a third 
of the flux measurement (equivalent to an error in magni- 
tude smaller than 0.36) in all i^-IRAC bands when test- 
ing L07, S05, D12, and KI. This requirement will remove 
many of the fainter objects, but the final sample is still 
among the deepest ever used to test these IR criteria. 
The magnitude distribution of the final sample consid- 
ered is shown in Figure [7] (see open histograms). Among 

13 http:/ /heasarc. nasa.gov/docs/software/tools/pimms. html 

14 The use of ratios based on FB flux instead of the commonly 
used SB/HB flux ratios, allows for an estimate of Nh when the 
source is detected in the FB but no detection is achieved in either 
the SB or HB. 



flux [M J y] 

1000 100 10 1 1000 100 10 1 




16 18 20 22 24 16 18 20 22 24 
mag [AB] 



Figure 7. The magnitude distribution of the final CDFS sample 
with reliable photometry in K-IRAC bands (open histogram) and 
K-IRAC-MIPS24(jm (hatched histogram). Each panel refers to the 
magnitude distribution in the following bands: K B (upper left), 
4.5 (im (upper right), 8.0 /im (lower left), and 24 /im (lower right). 
Note that in the latter both histograms coincide. 

the 1440 sources composing the final sample, 166 (12%) 
are classified as AGN hosts (160 in X-rays, 38 through 
spectroscopy). The sample is further separated into red- 
shift ranges: < z < I, l< z < 2.5, 2.5 < z < 4. The 
adopted threshold at z = 2.5 is based on the discussion 
in section |3~T1 This results in 801, 535, and 94 sources 
with i^s-IRAC photometry at0<z<l,l<z<2.5, 
and 2.5 < z < 4, respectively. 

When testing KIM we also require reliable 24 /im pho- 
tometry (hatched histograms in Figure[7]). However, this 
requirement restricts the sample to the brightest sources, 
unavoidably increasi ng the probability o f finding AGN 
dominated sources ([Brand et al.l 120061 : iTreister et al.l 
120061 : iDonlev et afl |2008|). Although it can be consid- 
ered a constraint by itself (which improves the criterion 
efficiency), this results in an unfair comparison of the ac- 
tual color-color regions. Hence, when comparing KIM to 
L07, S05, and KI, we consider the sample of 834 sources 
(460/324/47 at < z < 1, 1 <z < 2.5, and 2.5 < z < 4, 
respectively) with reliable if s -IRAC-MIPS24 Atm photom- 
etry, of which 139 (17%) are classified as AGN hosts. 

Tables [2 [3l and 0] summarize the final statistics for the 
application of each of the IR criteria to the CDFS con- 
trol sample at different redshift ranges. In order to as- 
sess the significance of the results, two types of error are 
presented: Poisson and photometric. The latter is com- 
puted by scattering the source colors between the ±lcr 
magnitude errors (which we limited to < 1/3 of the flux 
value) and evaluate how completeness (C) and reliability 
(1Z) vary as a result. However, there is a covariance be- 
tween observational photometric and Poisson sampling 
errors, since the samples with the smallest photomet- 
ric errors are the smallest in number, leading in turn 
to large sampling errors. Since our approach is based 
around testing colour-space boundaries that have already 
accounted for generous photometric variation around the 
SED templates in their definition, the current analysis is 
primarily limited by sampling (Poisson) errors. In gen- 
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Table 2 

CDFS X-ray and Spectroscopic < z < 1 control sample test. 



Sample Criterion N SBL a AGN b C c Tl c 

i\"+IRAC [none] 801 42 ... (5±1) 

L07 105 21 50(13,|) 20(5,|) 

S05 26 12 29(9, |) 46(16,^) 

D12 5 4 10(5,|) 80(54,| ) 

KI 24 12 29(9,°,) 50(18,^ 3 ) 

K +1RAC+ [none] 460 37 ~ (8±1) 

MIPS 2 4 Alm L07 76 18 49(14,g) 24(6,?) 

S05 20 11 30(10,|) 55(21, f 6 ) 

D12 5 4 ll(6,g) 80(54,P, ) 

KI 17 10 27(10,8) 59(23,|) 

KIM 10 7 19(8,3) 70 ( 34 >2o) 



Table 4 

CDFS X-ray and Spectroscopic 2.5 < z < 4 control sample test. 



Sample 


Criterion 


N SEL 


AGN 


C 


n 


K+IRAC 


[none] 


94 


40 




(43±8) 




L07 


93 


40 


100(22,°,) 


43(8,|) 




S05 


54 


29 


72(18,1°) 


54(12, 2 |) 




D12 


29 


20 


50(14,18) 


69(20,2?) 




KI 


73 


36 


90(21,3) 


49(10,|) 


K+IRAC+ 


[none] 


47 


33 




(70±16) 


MIPS 2 4 M m 


L07 


47 


33 


100(25,0) 


70(16,2) 




S05 


32 


24 


73(20,? 5 ) 


75(20, 7 4 ) 




D12 


20 


17 


52(15,1*) 


85(28,1?) 




KI 


41 


30 


91(23,°) 


73(18,*) 




KIM 


24 


20 


61(17,i 2 ) 


83(25,? 5 ) 



Note. — This table is restricted to the 2.5 < z < 4 CDFS sample. 
Table structure and column definition as in Table [2] 

complete (selecting almost 50% of the AGN sample), but 
the least reliable as well, with less than a fifth of the sam- 
ple being confirmed AGN hosts. This is understood given 
the large selection region set by the color constraints of 
L07 criterion, which despite selecting many AGN hosts 
(high C), also selects many non-AGN sources (low 1Z). 
S05, KI, and KIM present almost comparable efficiencies, 
selecting around a quarter of the AGN population, and 
with 50-60% reliabilities. D12 retrieves the lowest C val- 
ues (10%, comparable with KIM, see also Section 14.1.21) . 
but at the same time the highest 1Z values (80%, although 
with large Poisson errors) . 

At 1 < z < 2.5, L07 is still the criterion that provides 
the most complete AGN sample, but here it presents an 
7Z value of ~ 30%, comparable to that of S05. Both 
KI and KIM present improved 1Z levels, with 50-60% 
of the sample being confirmed as AGN hosts. The im- 
provement relative to the L07 and S05 criteria in this 
rcdshift range appears to be due to the [4.5] — [8.0] color 
cut used. The D12 criterion is again the criterion that 
presents the lowest C (20%) and the highest 1Z (80-90%). 
The D12 criterion is affected by small sample statistics, 
but it always recovers the highest 1Z values, supporting 
the scenario that it is in fact the most reliable of all the 
tested criteria. 

At high-z (2.5 < z < 4), the fraction of "true" AGN 
hosts is already high (43%, increasing to 70% when re- 
stricting to the MIPS24 M m detected sample). As a result 
of the small number of sources (driven by the necessary 
quality constraints) , no conclusion can be drawn regard- 
ing 7Z from this sample at such high redshifts, given that 
all criteria provide comparable 1Z results within the un- 
certainties. However, differences in completeness exist. 
D12 and KIM show the lowest values (C ~ 50 - 60%), 
KI and L07 the highest (C - 90 - 100%), while S05 gives 
intermediate values (C ~ 70%). 

One should also note the tendency for S05 and KIM to 
present the largest range between lower and upper pho- 
tometric limits. While in KIM this relates to the use of 
the less sensitive MIPS24 Aim band, the larger photomet- 
ric error ranges in S05 are due to the use of bands close 
in wavelength (Figure [2]) , instead of bands widely sepa- 



Note. — This table is restricted to the < z < 1 CDFS sample. 
While in the upper group of rows reliable photometry is required — a 
magnitude error below 0.36 — in if+IRAC bands, in the lower group 
of rows we also require reliable 24 (im photometry. The first row in 
each group refers to the total number of sources with reliable A'4-IRAC 
(upper group) and i-C+IRAC+24 /tm (bottom group) photometry. For 
reference, the value in parenthesis (with the corresponding Poisson er- 
ror) in the TZ column gives the overall fraction of identified AGN hosts, 
equivalent to the 1Z of a criterion selecting all sources in the field with 
reliable photometry in the considered bands. 

a Number of sources selected by a given criterion with an AGN/non- 
AGN classification from X-rays and/or spectroscopy. 

Number of selected sources with an AGN classification, from either 
the X-rays or optical spectroscopy. 

c Completeness (C) calculated as AGNsbl/AGNtot ■ Reliability (TV) 
calculated as AGNsel/Nsel- Numbers in parenthesis refer, respec- 
tively, to the Poisson error and the photometric upper and lower error 
bars. 



Table 3 

CDFS X-ray and Spectroscopic 1 < z < 2.5 control sample test. 



Sample 


Criterion 




AGN 


C 


K 


K+IRAC 


[none] 


535 


80 




(15±2) 




L07 


170 


50 


62(11,6) 


29(5,12) 




S05 


104 


28 


35(8,f) 


27(6,22) 




D12 


18 


15 


19(5,2) 


83(29,1^) 




KI 


60 


32 


40(8,1) 


53(12,lf) 


K+IKAC+ 


[none] 


324 


61 




(19±3) 


MIPS 2 4 M m 


L07 


110 


39 


64(13,1) 


35(7,1) 




S05 


70 


25 


41(10,1) 


36(8,ig) 




D12 


15 


11 


23(7,0) 


93(35,7) 




KI 


40 


26 


43(10|) 


65(16,? 6 ) 




KIM 


25 


16 


26(7,3) 


64(20,i|) 



Note. — This table is restricted to the 1 < z < 2.5 CDFS sample. 
Tabic structure and column definition as in Table [2] 

eral, the Poisson errors are larger than those that would 
be inferred from the photometric errors alone, so this ap- 
proach is also conservative in the sense of erring in favour 
of the larger uncertainty. The discussion and results pre- 
sented below are consequently based on a consideration 
of the Poisson errors associated with the sample sizes be- 
ing used in the analysis of Completeness and Reliability. 
In the low redshift bin (0 < z < 1), L07 is the most 
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Figure 8. The CDFS sources in KI (upper panels) and KIM 
(lower panels) color-color spaces, divided into low-z (z < 2.5, left 
panels) and high-2 (2.5 < z < 4, right panels) groups. Squares 
represent AGN hosts (as classified from X-rays or optical spec- 
troscopy), while dots highlight non-AGN sources. The dashed lines 
in the upper panels refer to the KI criterion, while the dashed lines 
in the lower panels refer to the IM region proposed in Section 13.21 
All sources displayed in the lower panels have K — [4.5] > 0, as 
required by the KIM criterion. 

rated in wavelength as in L07, KI and KIM. The large 
photometric errors found for the D12 criterion are prob- 
ably related to the small sample statistics (see COSMOS 
results below, where these errors are close to zero). 

Finally, Figure |8] details the application to the CDFS 
data of KI (upper panels) and KIM (lower panels), sep- 
arated into redshift bins (z < 2.5 and 2.5 < z < 4). For 
this exercise, we have required reliable photometry in 
the bands needed for KI or KIM. This figure shows that 
the KI and KIM criteria still miss a significant fraction 
(around two thirds) of the AGN population at z < 2.5. 
The high redshift panels show that most (around three 
quarters) of the overall population indeed falls inside the 
KI selection region (in agreement with Figure [3]). The 
low number of blue [8.0] — [24] sources is mainly a result of 
the photometric quality constraints applied to the sam- 
ple. Knowing that the MIPS24 /im band is also the less 
sensitive waveband, at high redshifts, where sources tend 
to be fainter, blue [8.0] — [24] sources will likely not be 
detected in the MIPS24 Mm band and consequently will 
be absent from this exercise. 

When no redshift estimate is available, the previous 
analysis, which separates the sample into redshift bins, 
would not be possible. Table [5] shows the results for the 
overall CDFS sample with no redshift constraint. The 
conclusions remain the same: KI and KIM are complete 
to ~50% and ~35% levels, respectively, and both show 
improved reliability (1Z ~ 50 — 70%) relative to L07 and 
S05 (KI and S05 error bars marginally overlap); and D12 
is less complete (with values comparable to KIM, C — 
25 - 30%), but the most reliable (U = 76 - 88%). 

4.1.2. KI/KIM efficiency m COSMOS 

The same comparison is now performed for COSMOS. 
No redshift segregation is applied as there is no classified 
SF system at z > 1.6 in this COSMOS sample. Among 



Table 5 

CDFS X-ray and Spectroscopic control sample test. 



Sample 


Criterion 




AGN 


C 


K 


tf+IRAC 


[none] 


1440 


166 




(12±1) 




L07 


377 


115 


69(8,|) 


31(3|) 




S05 


188 


73 


44(6j ) 


39(5, 22 ) 




D12 


55 


42 


25(4,|) 


76(16,2°) 




KI 


166 


84 


51(7,1) 


51(7,^) 


K+IRAC+ 


[none] 


834 


134 




(16±1) 


MIPS 2 4 M m 


L07 


236 


93 


69(9,|) 


39(5,4) 




S05 


125 


63 


47(7,|) 


50(8,1*) 




D12 


42 


37 


28(5,|) 


88(20,}?) 




KI 


101 


69 


51(8,1) 


68(11,|) 




KIM 


62 


46 


34(6,1) 


74(14,}0) 


Note. — No 


redshift cut 


is considered in this table. 


Tabic struc- 


ture and column definition 


is in Table [2] 










Table 


6 






COSMOS 


X-ray and 


Spectroscopic control sample test. 


Sample 


Criterion 




AGN 


C 


n 


K+IRAC 


[none] 


7216 


1423 




(20±1) 




L07 


2038 


1115 


78(3,1) 


55(2,1°) 




S05 


1102 


921 


65(3,f) 


84(4,? 3 ) 




D12 


586 


573 


40(2,|) 


98(6,0) 




KI 


967 


881 


62(3,|) 


91(4,1) 


K+IRAC+ 


[nonej 


2641 


842 




(32±1) 


MlPS 2 4 M m 


L07 


1087 


727 


86(4, 2 ) 


67(3,|) 




S05 


701 


629 


75(4, 2 ) 


90(5,1) 




D12 


464 


454 


54(3,1) 


98(6,0) 




KI 


643 


588 


70(4,1) 


91(5,2) 




KIM 


435 


398 


47(3,1) 


91(6,2) 



Note. — Table structure and column definition as in Table [2] 
No redshift range is adopted as there is no classified SF system at 
z > 1.6 in the COSMOS sample. 



the 7216 sources with either a spectral or X-ray clas- 
sification and adequate X-IRAC photometry, 1423 are 
flagged as AGN hosts by X-ray and spectroscopy crite- 
ria. There are 2641 sources with MIPS24^ m detection 
(842 AGN hosts) . Table [fj] reports the final statistics on 
the application of the various diagnostics. Note that 84% 
of this COSMOS sample is found at z < 1, implying that 
the statistics of this sample will be dominated by those 
of the z < 1 population. Hence, it is fair to compare 
these results with those found for the CDFS sample at 
< z < 1 (Table (2). The COSMOS sample nevertheless 
provides a more numerous sample, giving smaller Pois- 
son errors. The L07 criterion remains the most complete 
(C - 80%), but also the least reliable (C ~ 55 - 70%), as 
opposed to D12, which shows the highest reliability lev- 
els (1Z ~ 100%). KI marginally retrieves a more reliable 
AGN sample than S05, but comparable completenesses 
(C - 60 - 70% and K ~ 90%). KIM shows a reliability of 
~ 90%, but is likely the least complete criteria at z < 1. 
It is difficult to directly compare in absolute value the 
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results achieved with the CDFS and COSMOS samples, 
since many characteristics differ between the two sur- 
veys. As an example, by applying the COSMOS (IR and 
X-rays) flux limits to the CDFS sample, the C and TZ 
values for the different diagnostics are closer to those of 
COSMOS. Other issues may contribute to this, such as 
(a) different photometry extraction methods (ConvPhot 
by MUSIC team in CDFS and aperture photometry in 
COSMOS), (b) different spectral coverage depth and pro- 
cedures for spectral classification, (c) the different pho- 
ton indices used for the CDFS and COSMOS samples to 
convert from count rates to X-ray fluxes, (d) differences 
in relative sensitivity between soft and hard bands of the 
Chandra Space Telescope (in CDFS) and XMM-Newton 
(in COSMOS), and (e) cosmic variance. We stress, how- 
ever, that the results on the relative efficiency between 
the criteria are not contradictory between the two fields. 



4.2. IR-excess sources 

Also known as IR bright galaxies, IR-excess (IRxs) 
sources are believed to be part of an extreme IR popu- 
lation — the compton-thick (type-2) AGN — frequently 
missed by optical/X-ray surveys. IRxs sources are al- 
ways selected to have 24 n m to i?-band flux ratio of 
hi/ fa > 1000 (except in iDaddi et all 120071 who se- 
lect this type of sources based on a template fit pro- 
cedure). However, highly ob scured star-forming galaxies 
may a lso present such colors dDev et a l. 2008; Fiore et al.1 
120081: iPope et al.l 120081: INaravanan et al.l I2010D . It has 
been shown, nevertheless, that the AGN fraction in- 
creases as one restricts IRxs samples to brighter fluxes 
and/or redder NIR-to-optical colors. iDev et all (2008) 
show that at /24pm }t 0.6, ~ 50 — 80% of the IRxs sam- 
ple shows power-law S EDs (likely AGN-driven, but see 
INaravanan et al.l [20 10). and ~ 7 — 13% have detected 
X-ray counterparts (a hig her fraction than that found 
for non-IRxs sources). iDonlev et al.l (|2010T ) consider 13 
IRxs with /24 M m ^ 0.7, which show Spitzer-IRS spectra 
dominated by AGN emission (even though half shows 
detectable PAH emission), and their X-ray properties 
hint for heavily obscure d (Compton-th i ck) A GN activity 
in 80% of the sample. iPolletta et all 1)20081 ) considered 
/24/im ^ 1 IRxs sources, which also presented extremely 
red NIR-to-optical colors. All selected sources show the 
9.7 /jm silicate feature in absorption, have extreme MIR 
luminosities (L(6 /xm) ~ 10 46 ergs s _1 ), and always 
require the presence of torus emission to explain their 
MIR SED. Finally, a simp ler color combinat i on wa s ex- 
plored to fain ter fluxes by IFiore et al.l (|2008l . 12009ft and 
iTreister et all (pOOl ). Through means of X-ray stacking, 
the two groups showed that > 80% of IRxs sources with 
(R — A')vc ga > 4.5 show evidences for heavily obscured 
(Compton-thick) AGN activity. 

The diagnostics considered below rely on optical-to-IR 
color cuts, more precisely, R — K and R— [24]. However, 
i?-band photometry is not available in the MUSIC cata- 
log. We thus convert those colors to equivalent ones using 
i-band (i — K and i — [24] ) considering a powe r-law spec- 
trum (f v oc v a ). We highlight three criteria. IDev et all 
(|2008l D08) select sources with S 2 a/S r > 1000 and 
g 2 4 > 300 /Jy (equ ivalent to z - [24] > 7 and [24] < 17.5). 
IFiore et al.l (pool F08) select S 2 i/S R > 1000 sources 



Table 7 

Selection of IRxs sources. 



Region F08 D08 P08 







CDFS 






77 


10 


1 


L07 


72(94,15,|) 


9(90,41,J°) 


i (ioo,ioo,!J) 


S05 


29 (38,8, 2 2) 


5(50,27,? ) 


1 (100,100,g) 


D12 


18(23,6,|) 


5(50,27,g) 


i (ioo,ioo,g) 


KI 


40(52,10,J|) 


7(70,34,g) 


i (100,100,!]) 


KIM 


40(52,10,| 8 ) 


8(80,38,g) 


i (100,100,!!) 




COSMOS 






991 


256 


51 


L07 


909 (92,4,f 2 ) 


244(95,9,^) 


47 (92,19,2) 


S05 


381 (38,2,}f ) 


138 (54,6,|) 


39 (76,16,2.) 


D12 


202 (20,2,|) 


128(50,5,|) 


41 (80,17,2,) 


KI 


493(50,3,^) 


179(70,7,|) 


46 (90,18,|) 


KIM 


606 (61,3,11) 


212(83,8,|) 


50 (98,20,!!) 



Note. — Number of selected IRxs sources. The num- 
bers in parenthesis give the equivalent fractions, the Pois- 
son error, and the photometric upper and lower error bars. 



with R- K > 4.5 (i - [24] > 7 and i - K > 2.5 15 ) at 
S*24 > 40/iJy ([24] < 20). Finally, we also consider the 
bright est S^/Sr > 1 000 s ources by adopting the flux 
cut of IPolletta et al.l (|2008l P08), S 2 4 > lmJy (corre- 
sponding to [24] < 16.5). 

These criteria were applied to the MUSIC and COS- 
MOS catalogs and Table [7] details the numbers of the se- 
lected sources by each of the IR color criteria. In spite of 
the much smaller sample size in CDFS, the results agree 
remarkably well between the two surveys. In COSMOS, 
all criteria appear to select most of the brightest objects 
(P08) in a comparable way ( within the error bars), as 
expected (|Donlev et al.l 120081 : lEckart et all 120101 ) . Re- 
garding the fainter sources, L07 is the most complete, 
followed by KIM and KI, while S05 and D12 select the 
fewest IRxs sources. 

4.3. SDSS QSOs 

QSOs present in the Sloan Digital Sk y Survey Quasar 
Cata logue Data Release 7 (SDSS-DR7, iSchneider et al.l 
120171 were cross-matched (2" radius) with the Spitzer 
IR catalogues from the COSMOS (S-COSMOS), 
Lockman Hole, ELA IS-N1, and ELAIS-N2 (SWIRE, 
lLonsdale et all 12003ft fields using GATOR 16 at IRSA- 
NASA/IPAC. The final sample is composed by 293 
QSOs. A-band p hotometry comes fro m 2MASS (for 
21% of the s ample, ISkrutskie et al.ll2006ft . UKIDSS-DXS 
DR8 17 (29%.ILawrence et al.l l2007l). and COSMOS (23%, 
lllbert et al.ll2009ft . Overall, there are 186 QSOs with re- 
liable photometry in all IRAC channels. Of these, 140 

1 5 It should be noted, however, that the J — K color is likely 
more efficient than i — K in selecti ng AGN sources llMessias et al.l 
2010, sec also Glikman et al. 2004 for a combination of the two). 

16 http:/ /irsa. ipac.caltech.edu/applications/Gator/ 

17 UKIDSS uses th e UKIRT Wide Field Camera (WF- 
CAM; [ Casali et al. 20Q3) and a photometric system described in 
IHewett et al. (2006]). The pipeline processing and science archive 
are de scribed in Irwin et al. (in preparation) and IHamblv et al.l 
(2008). We have used data from the 8th data release. 
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Figure 9. The SDSS-DR7 QSOs found in the SWIRE and COS- 
MOS fields together with the Jiang ct al. (2006) sample displayed 
in KI (top left), IM (top right), L07 and D12 (bottom left), and 
S05 (bottom right) color-color spaces. Black dots represent z < 5 
sources (with small average photometric errors), and blue dots 
(with error bars) otherwise. 

have also MIPS24 Mm photometry, and 142 have i^-band 
photometry. We find 107 with full if-IRAC-MIPSa^m 
coverage. To enhance the high-z regime samp ling, we fur- 
ther include 13 SDSS-DR3 QSOs at z ~ 6 ('Jiang et all 
120061) . Of these, 12 are detected in all IRAC and 
MIPS24 /jm channels, while only five have 2MASS .fT-band 
data. 

Figure [9] shows the location of the QSO sample in the 
KI, IM (Section 13.21) . L07, and S05 color-color spaces. 
Only sources with reliable photometry in the displayed 
bands are shown. While L07, S05, and KI select most 
of the displayed sample (> 90%), D12 and KIM select 
a smaller, yet large, portion of it, respectively, 84% and 
65%. For z > 5 QSOs, the IM completeness drops to 
36%, in agreement with Figure [5l where QSO templates 
start to move out of the KIM region at z ~ 4 — 5. This 
is likely a result of both blue rest-frame optical SEDs, 
which most IRAC bands probe at these redshifts, low 
metallicity (consequently less dust) in these high redshift 
sources, and strong Ha emission. While blue rest- frame 
optical SEDs or strong Ha emission imply more objects 
in the blue side of [4.5] — [8.0], low metallicity results in 
some objects with bluer [8.0] — [24] colors. Note that S05 
and D12 retrieve opposite efficiencies in selecting z > 5 
QSOs, where S05 selects all and D12 selects none. 

We note, however, that if there is a prior indication 
for such high redshifts (z > 3 based on spectroscopy or, 
e.g., on the LBG technique), then the [8.0]- [24] color can 
be used by itself and much more efficiently for the iden- 
tification of AGN (cf. Figure For z > 5 QSOs, all 
but one show [8.0] - [24] > 1. The few QSOs with blue 
K — [4.5] (top left panel) colors are explained in light of 
the discussion in Section l3~Tl these are potentially AGN 
with smaller optical-to-IR flux ratios and/or sources pos- 
sessing strong line emission. 

The high completeness levels achieved with this optical 
selected sample show the eclectic selection of IR criteria. 
However, optically selected AGN are not the main targets 



Figure 10. The HzRG (z > 1) sample from lSevmour et al.l l|2007l ) 
displayed in the same color-color spaces as in Figure [9] Note the 
objects at 2 < [8.0] — [24] < 4 which are even redder than QSOs 
(Figure [9). Dots show the z < 3 population, crosses show the 
population at z > 3. The average photometric errors of this sample 
are shown on the bottom right of each panel for the z < 3 (left error 
bars) and z > 3 (right) samples. 

of IR AGN diagnostics, as, by definition, optical surveys 
do detect them. The most interesting use of these crite- 
ria is to recover sources undetected at X-ray and optical 
wavelengths. Sections 14.21 (above) and !4.4l (below) are, in 
this respect, much more representative of the usefulness 
of IR AGN diagnostics. 

4.4. High redshift Radio Galaxies 

To test yet another AGN population, we now consider 
High-z Radio Galaxies (HzRGs). These are among the 
most luminous sources in the Universe and are believed 
to ho st powerful AGN. We use the sample of 71 HzRGs 
from iSevmour et al.l (|2007l ) . These are all at z > 1, a 
redshift range where no normal galaxy is believed to con- 
taminate the AGN IM region proposed in Section 13.21 
This is a classic example — such as that of LBGs or 
spectroscopically confirmed galaxies at z > 1 — for the 
direct application of the IM boundaries. Having this, 
the K — [4.5] > color cut is not required to disentangle 
AGN/non-AGN dominated sources at z < 1 (see Sec- 
tion [231, meaning that one may consider [4.5] — [8.0] 
and [8.0] — [24] colors alone to determine whether AGN 
or stellar emission dominates the IR spectral regime. 

Figure [10] shows the location of 62 HzRGs in the IM 
color-color diagram (top right panel). Note the differ- 
ence to SDSS QSOs (Figure O, where HzRGs show pre- 
dominantly redder [8.0] — [24] colors. The AGN region 
correctly selects as AGN 88% (37 sources) of the sample 
with adequate photometry (42 sources detected at 4.5, 
8.0, and 24/xm). If no redshift estimate was available, 
however, one would need the K — [4.5] > color cut to 
apply the IM AGN criterion, i.e., the KIM criterion. The 
application of KIM would result in a 78% completeness 
level (28 out of 36 sources). The remaining panels show 
that L07 selects 86% (38 out of 44 sources), D12 selects 
27% (12 out of 44 sources), S05 selects 73% (32 out of 44 
sources), and KI selects 78% (32 out of 41 sources). 
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Again we emphasize that much of the improvement of 
KI/KIM over the commonly used L07 and S05 diagnos- 
tics is in terms of reliability, an indicator which is not be- 
ing evaluated with this sample nor those in Sections 14.21 
and[01 

4.5. Sub- Millimetre Galaxies 

In Mainline et all (|2011l ) one can find estimates for the 
non-stellar contribution (shown in that work to be re- 
lated to AGN activity) to the IR SED of Sub-Millimetre 
Galaxies (SMGs), showing that significant AGN emis- 
sion is present even at 1.6 /mi (~ 50 — 60% of the sample 
with an estimate shows AGN contributions of > 20—10% 
at these wavelengths). The estimates for the non-stellar 
contribution were obtained photometrically by fitting the 
SMGs SEDs with a combination of pure star-forming 
models and power-law spectra (/ cx v a , with a = — 2 
or —3). These non-stellar contribution estimates depend 
on whether one considers a constant star-formation his- 
tory or an initial instant burst followed by passive evolu- 
tion. These assumptions provide two extreme scenar- 
ios, and here we adopt the average between t he two. 
The sample is that from iChapman et al.l ()2005[ ). found 
to be at 0.1 < z spC c < 3.4 (~ 85% of the sample is at 
z spec > 1). The -FT-band photometry is presented in 
ISmail et al.l (EOQl, while the IRAC and MIPS photom- 
etry is presented in IHainline et al.l (|2009l ) . 

Figure [TT] shows the distribution of SMGs in differ- 
ent color-color spaces. The intensity of each data point 
relates to the AGN contribution at observed 8 /xm (the 
darker it is, the higher is the AGN contribution). Specif- 
ically in the IM plot (upper right panel), even though 
there are two sources with an estimated 0% contribution 
from AGN inside the selection region, all but one (with 
25%) of the remainder (19 sources) have > 50% of AGN 
contribution. Overall, the IM selected sample shows an 
average of 62+g% 18 of AGN contribution at observed 
8 /mi. L07, S05, D12, and KI show 19 , respectively, av- 
erages of 55±3%, 66±4%, 75±7%, and 60±l%. Although 
all criteria select at least one SMG with an estimated 
0% contribution from AGN, the average AGN contribu- 
tion is always > 55%. This means that for many SMGs, 
although the AGN does not dominate the IR emission, 
the AGN contributes significantly to the IR SED of the 
galaxy. The same is expected for IR selected AGN, this 
is, although we see selected sources where the AGN does 
not dominate at IR wavelengths, the AGN is present and 
active, contributing enough for the source to be selected 
as an IR AGN. 

5. IMPLICATIONS FOR JWST SURVEYS 

The start of scientific observations of JWST, the suc- 
cessor of Spitzer at mid-IR wavelengths, is expected for 
2018. It will be a 6.5 m space telescope which will operate 
from 0.6 to 27 /zm. As highlighted in this and previous 
work, this spectral regime has great potential for sepa- 
rating AGN from normal (non-AGN) galaxies. 

18 The errors only consider the propagated errors associated to 
each individual estimate used for averaging. 

19 Separately considering sources detected in bands involved in 
each criteria, hence subject to selection effects. For instance, 49 
SMGs are detected in all four IRAC channels (used for L07, S05, 
and D12 statistics), while 34 are detected in A"-band, 4.5 /an and 
8.0 fim (used for KI statistics). 
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Figure 11. The SMG sample from Chapman et al. (2005) dis- 
played in the same color-color spaces as in Figure [9] Only sources 
detected in the referred bands in each panel are displayed. The 
average photometric errors are shown on the bottom right of each 
panel. The data point intensity shows the AGN fraction at ob- 
served 8 ^m (intensity bar in the center). Sources with AGN con- 
tribution upper limits are shown as crosses. 

The sensitivity will of course be better than ever 
before, and the high-z universe will be probed with 
unprecedented detail. Many galaxies will be studied 
with MIR spectroscopy, and signs of AGN activit y will 
be naturally found that way (e.) 



jns oi auin activity will 
3.g .. lLaurent et alJ l2000t 
iLl l2009t IFu et all 1201(1) . 



lArmus et all 120071: iVeilleux et all 
When dealing with large surveys, however, with thou- 
sands of sources and many close to the detection limit, 
AGN selection will have to rely on photometric diagnos- 
tics such as the KI/KIM criteria presented here. By se- 
lecting AGN candidates over a broad range of redshifts, 
< z < 7, the KIM criterion will enable the study of 
AGN phenomena to the earliest epochs. 

While the KI/KIM criteria can already be applied 
to current data from Spitzer, potentially more efficient 
MIR criteria will be possible with the large wavelength 
coverage of the JWST. Using planned JWST filter re- 
sponse curves 20 , we suggest a possible and promising 
color-color space alternative to that proposed in Sec- 
tion [321 using the MIRI 10/im and 21/im filters instead 
of the IRAC 8.0/xm and MIPS 24itm bands, and the Nir- 
CAM 4.4/jm instead of IRAC 4.5/im (note that these 
are bands close to those used in Wide-field IR Survey 
Explo rer, WISE; see also lAssef eFall l2010t IStern et all 
2012). The selection conditions are the following: 



and 



[10] - [21] > -5.2 x ([4.4] - [10]) + 2.2 



[10] - [21] > 0.1 



In Figure O the four panels show that the [4.4]-[10] 
versus [10]-[21] color-color space seems to present a bet- 
ter selection of the AGN/Hybrid model tracks. The 



Provided online at: 



http://www.stsci.edu/jwst/instruments/nircam/instrumcntdcsign/filtcrs/indcx. 
http: / /www. stsci.edu/jwst/instruments/miri/ 
instrumcntdesign / miri_glance.html . 
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Figure 12. An alternative color-color space with JWST bands 
which might improve the AGN selection at < z < 7. Symbols 
and panels definition as in Figure [2] The light- / dark-grey error 
circles in the tracks result from larger colour gradients between 
redshift steps (Az = 0.01 each). These "circles" correspond to the 
error ellipse of each redshift point in the colour-tracks. 

AGN model tracks are better delineated by the selec- 
tion boundaries (recovering the z > 4 — 5 QSO missed by 
KIM) and, as a bonus, the 21/mi filter is over three times 
more sensitive than the planed MIRI 25/im filter (equiva- 
lent to the MIPS 24/im filter), increasing the probability 
of a detection needed for an AGN classification. This 
is shown in Figures [13] and [14] where AGN dominated 
sources are detected up to the highest redshift consid- 
ered in this work (z ~ 7). 

Finally, the improved resolution of JWST will allow the 
study of the nuclear region alone up to higher redshifts. 
This, of course, a llows the identification of lower lum i- 
nosity AGN (e.g.. iHonig etaHl20ToHAsmus et alJl20ll . 
thus overcoming the difficulty in selecting such systems 
in the IR. 

6. CONCLUSIONS 

Based on semi-empirical galaxy SED templates, we 
have developed physically motivated IR color criteria for 
the selection of a wide variety of AGN in a large redshift 
range (0 < z < 7). As well as the application to exist- 
ing data (e.g., the recently available W ISE data 21 at 3.4, 
4.6, 12, and 22 ^m. [Wright et aLll2010fl . these criteria are 
particularly relevant for the JWST, given the wide MIR 
spectral range considered. We thus propose new AGN 
IR diagnostics. The K — [4.5] color is ideal for the z < 1 
universe (Section 13. lft : KI is a reliable alternative to the 
IRAC-based diagnostics at z < 2.5 (based on Poisson er- 
rors alone, Sections [3.11 and |4~T|) ; and KIM (Section |3~2|) . 
a four band (K, 4.5, 8.0, and 24 fim), three color (K- 
[4.5], [4.5]- [8.0], and [8.0]- [24]) criterion is proposed as a 
new 'wedge' criterion to select AGN hosts over the full 
< z < 7 range, based on template colour tracks alone 
(the current control samples prevent any conclusion at 
z > 2.5 due to small sample statistics). 

The absolute efficiency of IR AGN criteria is, at this 

21 http:/ /wise2. ipac.caltech.edu/docs/release/prelim/ 
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Figure 13. SED flux evolution with redshift for two star- 
formation dominated systems (Arp220 and IRAS 22491-1908, up- 
per panels), and two AGN dominated systems (IRAS 19254- 
7245 and Mrk231, lower panels). The redshift steps are z = 
z , 0.5, 2.5, 7. The blue dots indicate the A~-IRAC-MIPS 2 4 um 
CDFS IOct total flux level (based in Table 1 of IWuvts eTaHOTa ) . 
red dots give the 10<r level (at equivalent CDFS integration times) 
of the JWST filters: 2.0 /he, 4.4 fim, 10 yum, 21 fim, and 25 fim. At 
longer wavelengths, the gap between Spitzer and JWST's sensitiv- 
ities is smaller due to the warmer telescope thermal background of 
JWST. 




Figure 14. Flux evolution with redshift for starbursts (blue solid 
line), hybrids (magenta dashed line), and AGN (black dotted line) 
in four JWST filters. Red dotted horizontal lines mark the IOct 
level (10,000 s integration) of each filter. 

stage, impossible to obtain as the control (X-ray and op- 
tical selected) samples used here are themselves incom- 
plete, hence, results should be treated with care. For in- 
stance, KI achieves reliabilities of - 50% in CDFS (~90% 
in COSMOS) and KIM shows reliabilities of -60-80% 
(~90%). These should be considered lower- limits, be- 
cause X-ray and spectroscopic data may not have been 
deep enough to reveal AGN features (Section H]), and we 
show that the selected sources with SF-dominated SEDs, 
are still likely to host an active AGN. 
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Figure 15. The left hand-side panel shows the dependency of the flux ratio on column density (Nh) and redshift. The numbers refer 
to the log(Njj[cm~ 2 ]) value corresponding to each line. The reference level log(Njj [cm - 2 ]) = 22 is shown as solid line. The right-hand 
panel shows the observed flux ratios for the CDFS and COSMOS samples. The log(N[j[cm" 2 ]) = 22 value is shown again as solid line for 
reference, as well as the log(Njj [cm - 2 ]) = 19 value, above which we have considered log(Nj] [cm - 2 ]) = and the observed photon index to 
estimate X-ray luminosities. 



Nevertheless, in the coming years, these criteria should 
be improved as a result of the rich variety of filters to be 
incorporated in the instruments on board the upcoming 
JWST (Section [5]). The ability to track AGN activity 
since the end of the reionization epoch will hold great 
advantages for the study of galaxy- AGN co-evolution. 
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APPENDIX 

It has been pointed out that the estimate of column densities (N h) based on soft-to-hard X-ray flux ratios overesti- 
mate s the type-2 (obscured) object fraction with increasing redshift (jUeda et al.ll2003HAkvlas et al.ll2006t iDonlev et al.l 
12012ft . The fact that we consider soft-to-full or hard-to-full band ratios does not change the scenario. This bias effect 
happens due to a harder spectral range being probed at higher redshifts by the observed 0.5-8/10keV spectral window 
plus the error associated with the flux measurement. Figure 115b shows the soft-to-full ratio variation depending on 
redshift and Nh- One can see that with increasing redshifts, for the smallest Nh values, the associated flux ratios 
become problematically degenerate 22 . Would there be perfect precision and accuracy in the flux measurements, this 
would hold no problem, however, there is a flux error which scatters the real values producing the observational bias. 
Figure [15b shows the observed values from the CDFS and COSMOS samples for reference. 

Figure [TSa shows the overall bias toward type-2 objects over type-1 objects depending on redshift and flux ratio error 
(in percent). In order to produce this figure, we have considered an initial 50/50% fraction of type-l/type-2 objects and 
a Gaussian probability density distribution (PDD) centred at flux ratios corresponding to 19 < log(NH[cm -2 ]) < 25 
and with a a = 5, 10, 20, 30% of the flux ratio. The Gaussian PDD gives the probability for the output (observed) 
classification to be unobscured (log(NH[cm~ 2 ]) < 22) or obscured (log(NH[cm~ 2 ]) > 22), consequently providing the 
overall output bias shown in Figure fTBk . In this figure, the bias is calculated as the ratio between the output (observed) 
type-2/type-l ratio and the input (intrinsic) type-2/type-l ratio. Note the trends maximize at ~ 2. If we instead had 
restricted this exercise to the 20 < log(NHfc m~ 2 1) < 24 range (co nsidered in lDonlev et al.ll2012ft . the maximum value 
would be at ~ 1.8. This is the bias found bv lDonlev et al.l (|2012f ) at z ~ 3. This means, for the z ~ 3 XMM sources 
used in that work, the flux ratios are affected on average by a ~ 30% error. 

The Gaussian PDD also provides the average overestimate associated with a given intrinsic Nh value depending on 

22 At z ~ 0, there is also a degeneracy at log(Nji[cm~ 2 ]) = 23 — 25, but it does not represent a problem, because the constraint considered 
to distinguish type-1 from type-2 objects is log(NH[cm — 2 ]) = 22. 
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Figure 16. The left hand-side panel (a) shows the bias toward type-2 objects evolution with redshift for different levels of error in the flux 
ratio estimate. The right hand-side panel (b) shows, for a 20% error level in the flux ratio estimate, the average overestimate depending 
on column density (Nh) and redshift. 

flux ratio error and redshift. This is shown in Figure ITSb for a 20% flux ratio error. These trends allow us to attempt 
the correction of the referred bias, where the observed Nh value (y-axis in Figure \Wb) corresponds to a intrinsically 
smaller Nh value (x-axis) for a given flux ratio error at the source's redshift. The flux ratios used to estimate Nh in 
this work are all based on measurements with errors less than a third of the fluxes, a propagated flux ratio error of 
< 47% (the average accepted flux ratio error was ~ 19%). Even though individually the corrected value should not 
be regarded as the best estimate, this average correction applied to the overall population will diminish the bias. The 
number of type-1 objects which would be classified as type-2 if no correction was applied is small: 14 in CDFS and 9 
in COSMOS. Most of these (19 objects) are at z > 3. 



REFERENCES 

Adelberger, K. L., Steidel, C. C, Shapley, A. E., et al. 2004, ApJ, 
607, 226 

Afonso, J., Mobasher, B., Chan, B. & Cram, L. 2001, ApJ, 559, 
101 

Afonso, J., Mobasher, B., Koekemoer, A., Norris, R. P., & Cram, 

L. 2006, AJ, 131, 1216 
Akylas, A., Georgantopoulos, I., Georgakakis, A., Kitsionas, S., & 

Hatziminaoglou, E. 2006, A&A, 459, 693 
Alexander, D. M., Bauer, F. E., Chapman, S. C, Smail, I., Blain, 

A. W., Brandt, W. N., & Ivison, R. J. 2005, ApJ, 632, 736 
Alonso-Herrero, A., et al. 2006, ApJ, 640, 167 

Armus, L., Charmandaris, V., Bernard-Salas, J., et al. 2007, ApJ, 
656, 148 

Asmus, P., Gandhi, P., S mette, A., Honig, S. F., & Duschl, W. J. 

2011. larXiv:1109.4873l 
Assef, R. J., Kochanek, C. S., Brodwin, M., et al. 2010, ApJ, 713, 

970 

Assef, R. J., Kochanek, C. S., Ashby, M. L. N., et al. 2011, ApJ, 
728, 56 

Barmby, P., et al. 2006, ApJ, 642, 126 
Bongiorno, A., et al. 2010, A&A, 510, 56 

Bower, R. G, Benson, A. J., Malbon, R., et al. 2006, MNRAS, 
370, 645 

Brand, K., et al. 2006, ApJ, 644, 143 
Brusa, M., et al. 2010, ApJ, 716, 348 

Calzetti, D., Kennicutt, R. C, Engelbracht, C. W., et al. 2007, 

ApJ, 666, 870 
Cappelluti, N., et al. 2009, A&A, 497, 635 
Cardamone, C. N., et al. 2008, ApJ, 680, 130 

Casali, M., Adamson, A., Alves de Oliveira, C, et al. 2007, A&A, 
467, 777 

Chambers, K. C, Miley, G. K., van Breugel, W. J. M., & Huang, 

J.-S. 1996, ApJS, 106, 215 
Chapman, S. C, Blain, A. W., Smail, I., & Ivison, R. J. 2005, 

ApJ, 622, 772 



Comastri, A., Fiore, F., Vignali, C, et al. 2001, MNRAS, 327, 781 
Croton, D. J., Springel, V., White, S. D. M., et al. 2006, MNRAS, 
365, 11 

Daddi, E., et al. 2007, ApJ, 670, 156 
Dey, A., et al. 2008, ApJ, 708, 943 

de Grijp, M. H. K., Miley, G. K., Lub, J., & de Jong, T. 1985, 

Nature, 314, 240 
D onley, J. L., Koek emoer, A. M., Brusa, M., et al. 2012, 

larXiv:1201.3899l 
Donley, J. L., Rieke, G. H., Alexander, D. M., Egami, E., & 

Perez-Gonzalez, P.G 2010, ApJ, 719, 1393 
Donley, J. L., et al. 2008, ApJ, 687, 111 
Donley, J. L., et al. 2007, ApJ, 660, 167 
Draper, A. R., & Ballantyne, D. R. 2009, ApJ, 707, 778 
Eckart, M., McGreer, I. D., Stern, D., Harrison, F. A., & Helfand, 

D. J. 2010, ApJ, 708, 584 
Eckart, M. E., Stern, D., Helfand, D. J., et al. 2006, ApJS, 165, 19 
Edelson, R., & Malkan, M. 2012, ApJ, 751, 52 
Elbaz, D., Flores, H., Chanial, P., et al. 2002, A&A, 381, LI 
Elbaz, D., Dickinson, M., Hwang, H. S., et al. 2011, A&A, 533, 

A119 

Elvis, M., Wilkes, B. J., McDowell, J. C, et al. 1994, ApJS, 95, 1 
Engelbracht, C. W., Rieke, G. H., Gordon, K. D., et al. 2008, 
ApJ, 678, 804 

Fadda, D., Yan, L., Lagache, G, et al. 2010, ApJ, 719, 425 
Farrah, D., Bernard-Salas, J., Spoon, H. W. W., et al. 2007, ApJ, 
667, 149 

Fazio, G. G, Hora, J. L., Allen, L. E., et al. 2004, ApJS, 154, 1 
Fiore, F., et al. 2008, ApJ, 672, 94 

Fiore, F., Puccetti, S., Brusa, M., et al. 2009, ApJ, 693, 447 
Fu, H., Yan, L., Scoville, N. Z., et al. 2010, ApJ, 722, 653 
Garn, T., et al. 2009, MNRAS, tmp, 1909 

Georgakakis, A., Rowan-Robinson, M., Nandra, K., et al. 2010, 

MNRAS, 406, 420 
Giacconi, R., Rosati, P., Tozzi, P., et al. 2001, ApJ, 551, 624 
Gilli, R. 2004, Advances in Space Research, 34, 2470 



17 



Glikman, E., Gregg, M. D., Lacy, M., et al. 2004, ApJ, 607, 60 
Granato, G. L., De Zotti, G., Silva, L., Bressan, A., & Danese, L. 

2004, ApJ, 600, 580 
Grazian, A., et al. 2006, A&A, 449, 951 

Hainline, L. J., Blain, A. W., Smail, I., et al. 2009, ApJ, 699, 1610 
Hainline, L. J., Blain, A. W., Smail, I., et al. 2011, ApJ, 740, 96 
Hambly, N. C, Collins, R. S., Cross, N. J. G., et al. 2008, 

MNRAS, 384, 637 
Hatziminaoglou, E., et al. 2005, AJ, 129, 1198 
Henriques, B., Maraston, C, Monaco, P., et al. 2011, MNRAS, 

415, 3571 

Hewett, P. C, Warren, S. J., Leggett, S. K., & Hodgkin, S. T. 

2006, MNRAS, 367, 454 

Ho, L. C, Terashima, Y., & Ulvestad, J. S. 2003, ApJ, 589, 783 
Honig, S. F., Kishimoto, M., Gandhi, P., et al. 2010, A&A, 515, 
A23 

Hopkins, P. F., Hernquist, L., Martini, P., et al. 2005, ApJ, 625, 
L71 

Hopkins, P. F., Hernquist, L., Cox, T. J., et al. 2006, ApJS, 163, 1 

Hu, E. M., & Ridgway, S. E. 1994, AJ, 107, 1303 

Hunt, L. K., Thuan, T. X., Izotov, Y. I., & Sauvage, M. 2010, 

ApJ, 712, 164 
Ilbert, O., et al. 2009, ApJ, 690, 1236 

Ivezic, Z., Menou, K., Knapp, G. R., et al. 2002, AJ, 124, 2364 

Ivison, R. J., et al. 2004, ApJS, 154, 124 

Jiang, L., et al. 2006, AJ, 132, 2127 

Lacy, M., et al. 2004, ApJS, 154, 166 

Lacy, M., Petric, A. O., Sajina, A., Canalizo, G., 

Storrie-Lombardi, L.J., Armus, L., Fadda, D., & Marleau, F. R. 

2007, AJ, 133, 186 

Laor, A. 1999, Phys. Rep., 311, 451 

Laurent, O., Mirabel, I. F., Charmandaris, V., Gallais, P., 

Madden, S. C, Sauvage, M., Vigroux, L., & Cesarsky, C. 2000, 
ApJS, 359, 887 

Lawrence, A., Warren, S. J., Almaini, O., et al. 2007, MNRAS, 
379, 1599 

Le Floc'h, E., et al. 2009, ApJ, 703, 222 
Lonsdale, C. J., et al. 2003, PASP, 115, 897 

Loewenstein, M., Mushotzky, R. F., Angelini, L., Arnaud, K. A., 

& Quataert, E. 2001, ApJ, 555, L21 
Luo, B., et al. 2008, ApJS, 179, 19 
Luo, B., et al. 2010, ApJS, 187, 560 
Maiolino, R., et al. 2003, MNRAS, 344, 59 
Maraston, C. 2005, MNRAS, 362, 799 

Martinez-Sansigre, A., Rawlings, S., Lacy, M., et al. 2005, Nature, 
436, 666 

McCracken, H. J., Capak, P., Salvato, M., et al. 2010, ApJ, 708, 
202 

Melia, F., & Falcke, H. 2001,ARA&A, 39, 309 

Messias, H., Afonso, J., Hopkins, A., Dominici, T., Mobasher, B., 

& Alexander, D. M. 2010, ApJ, 719, 790 
Miley, G. K., Neugebauer, G., & Soifer, B. T. 198 5, ApJ, 293, Lll 
Morganti, R., Holt, J., Tadhunter, C, et al. 2011, arXiv: 1109.0630 
Narayanan, D., Dey, A., Hayward, C. C, et al. 2010, MNRAS, 

407, 1701 

Neugebauer, G, Oke, J. B., Becklin, E. E., & Matthews, K. 1979, 
ApJ, 230, 79 

Neugebauer, G, Habing, H. J., van Duinen, R., et al. 1984, ApJ, 
278, LI 

Norris, R. P., Afonso, J., Cava, A., et al. 2011, ApJ, 736, 55 
Papovich, C, Moustakas, L. A., Dickinson, M., et al. 2006, ApJ, 
640, 92 

Papovich, C, Rudnick, G, Le Floc'h, E., et al. 2007, ApJ, 668, 45 
Patten, B. M., Stauffer, J. R., Burrows, A., et al. 2006, ApJ, 651, 
502 

Petric, A. O., Armus, L., Howell, J., et al. 2011, ApJ, 730, 28 



Pier, E. A. & Krolik, J. H. 1992, ApJ, 401, 99 

Pollctta, M., Weedman, D., Honig, S., Lonsdale, C. J., Smith, H. 

E., & Houck, J. 2008, ApJ, 675, 960 
Polletta, M., et al. 2007, ApJ, 663, 81 
Polletta, M., et al. 2006, ApJ, 642, 673 

Pope, A., et al. 2008, ApJ, 675, 1171 

Pozzi, F., Vignali, C, Gruppioni, C, et al. 2012. larXiv:1204.1152l 
Richards, G. T., Fan, X., Newberg, H. J., et al. 2002, AJ, 123, 
2945 

Richards, G. T., et al. 2009, AJ, 137, 3884 
Rieke, G. H., et al. 2009, ApJ, 692, 556 

Rigby, J. R., Marcillac, D., Egami, E., et al. 2008, ApJ, 675, 262 
Roche, N. D., Dunlop, J., & Almaini, O. 2003, MNRAS, 346, 803 
Sajina, A., Lacy, M., & Scott, D. 2005, ApJ, 621, 256 
Salvato, M., et al. 2009, ApJ, 690, 1250 

Salvato, M., Ilbert, O., Hasinger, G., et al. 2011, ApJ, 742, 61 

Sanders, D. B., et al. 2007, ApJS, 172, 8 

Sanders, D. B. 1999, ASP, IAU Symp. 194, 25 

Sanders, D. B., Phinney, E. S., Neugebauer, G, Soifer, B. T., & 

Matthews, K. 1989, ApJ, 3 47, 29 

Sani, E., & Nardini, E. 2011. larXiv:1112.5170l 
Santini, P., et al. 2009, A&A, 504, 751 
Schneider, D. P., et al. 2010, AJ, 139, 2360 
Scoville, N., et al. 2007, ApJS, 172, 1 
Seymour, N., et al. 2007, ApJS, 171, 353 

Simpson, C, Martinez-Sansigre, A., Rawlings, S., et al. 2006, 

MNRAS, 372, 741 
Skrutskie, M. F., Cutri, R. M., Stiening, R., et al. 2006, AJ, 131, 

1163 

Smail, I., Chapman, S. C, Blain, A. W., &; Ivison, R. J. 2004, 
ApJ, 616, 71 

Somerville, R. S., Hopkins, P. F., Cox, T. J., Robertson, B. E., & 

Hernquist, L. 2008, MNRAS, 391, 481 
Springel, V., Di Matteo, T., & Hernquist, L. 2005, MNRAS, 361, 

776 

Steidel, C. C, Adelberger, K. L., Shaplcy, A. E., et al. 2003, ApJ, 
592, 728 

Steidel, C. C, Shapley, A. E., Pettini, M., et al. 2004, ApJ, 604, 
534 

Stern, D., et al. 2005, ApJ, 631, 163 

Stern, D., Assef, R. J., Benford, D. J., et al. 2012. larXivTT2 05.0811 
Szokoly, G. P., et al. 2004, ApJS, 155, 271 

Tielens, A. G. G. M., Miley, G. K., & Willis, A. G. 1979, A&AS, 
35, 153 

Tozzi, P., et al. 2006, A&A, 451, 457 

Treister, E., Cardamone, C. N., Schawinski, K., et al. 2009, ApJ, 
706, 535 

Treister, E., et al. 2006, ApJ, 640, 603 
Treister, E., & Urry, C. M. 2005, ApJ, 630, 115 
Treister, E., & Urry, C. M. 2011, larXiv:1112.0320l 
Ueda, Y., Akiyama, M., Ohta, K., & Miyaji, T. 2003, ApJ, 598, 
886 

Vandame, B. 2002, Proc. SPIE, 4847, 123 

Veilleux, S., Rupke, D. S. N., Kim, D.-C, et al. 2009, ApJS, 182, 
628 

Werner, M. W., Roellig, T. L., Low, F. J., et al. 2004, ApJS, 154, 
1 

Worsley, M. A., Fabian, A. C, Barcons, X., et al. 2004, MNRAS, 
352, L28 

Worsley, M. A., Fabian, A. C, Bauer, F. E., et al. 2005, MNRAS, 
357, 1281 

Wright, E. L., Eisenhardt, P. R. M., Mainzer, A. K., et al. 2010, 

AJ, 140, 1868 
Wuyts, S., et al. 2008, ApJ, 682, 985 



